Analysis of Asymmetric Measures for Performance Estimation of a Sentiment Classifier
نویسندگان
چکیده
The development of a sentiment classifier experiences two problems to cope with: the demand of large amounts of labelled training data and a decrease in performance when the classifier is applied to a different domain. In this paper, we attempt to address this problem by exploring a number of metrics that try to predict the cross-domain performance of a sentiment classifier through the analysis of divergence between several probability distributions. In particular, we apply similarity measures to compare different domains and investigate the implications of using non-symmetric measures for contrasting feature distributions. We find that quantifying the difference between domains is useful to predict which domain has a feature distribution most similar to the target domain.
منابع مشابه
یک چارچوب نیمهنظارتی مبتنی بر لغتنامه وفقی خودساخت جهت تحلیل نظرات فارسی
With the appearance of Web 2.0 and 3.0, users’ contribution to WWW has created a huge amount of valuable expressed opinions. Considering the difficulty or impossibility of manually analyzing such big data, sentiment analysis, as a branch of natural language processing, has been highly considered. Despite the other (popular) languages, a limited number of research studies have been conducted in ...
متن کاملMeasuring Term Specificity Information for Assessing Sentiment Orientation of Documents in a Bayesian Learning Framework
The assessment of document sentiment orientation using term specificity information is advocated in this study. An interpretation of the mathematical meaning of term specificity information is given based on Shannon’s entropy. A general form of a specificity measure is introduced in terms of the interpretation. Sentiment classification using the specificity measures is proposed within a Bayesia...
متن کاملAutomated Tumor Segmentation Based on Hidden Markov Classifier using Singular Value Decomposition Feature Extraction in Brain MR images
ntroduction: Diagnosing brain tumor is not always easy for doctors, and existence of an assistant that facilitates the interpretation process is an asset in the clinic. Computer vision techniques are devised to aid the clinic in detecting tumors based on a database of tumor c...
متن کاملA Grouping Hotel Recommender System Based on Deep Learning and Sentiment Analysis
Recommender systems are important tools for users to identify their preferred items and for businesses to improve their products and services. In recent years, the use of online services for selection and reservation of hotels have witnessed a booming growth. Customer’ reviews have replaced the word of mouth marketing, but searching hotels based on user priorities is more time-consuming. This s...
متن کاملSentiment analysis methods in Sentiment analysis methods in Persian text: A survey
With the explosive growth of social media such as Twitter, reviews on e-commerce website, and comments on news websites, individuals and organizations are increasingly using opinions in these media for their decision making. Sentiment analysis is one of the techniques used to analyze userschr('39') opinions in recent years. Persian language has specific features and thereby requires unique meth...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Research in Computing Science
دوره 65 شماره
صفحات -
تاریخ انتشار 2013